Automatic Segmentation of Continuous Speech on Word and Phrase Level based on Suprasegmental Features

نویسندگان

  • Klára Vicsi
  • György Szaszák
چکیده

This article investigates whether it is possible to segment continuous speech on word and phrasal level by examination of suprasegmental parameters, in case of bound stress languages like Hungarian and Finnish. The final aim is to increase the robustness of speech recognition on language modelling level by the detection of word and phrase boundaries and so we can significantly decrease the searching space during the decoding process. In Hungarian language if stress is present, it is always on the first syllable of the word stressed. Thus if stressed syllables can be detected, word boundaries are also detectable with reliable accuracy. We have developed different algorithms based either on rule based or on data-driven approach. Rule based algorithms and HMM based method are compared. The best results were obtained by data-driven algorithms using the time series of fundamental frequency and energy together. Syllable length was found much less effective hence it was discarded. By using of suprasegmental features word boundaries can be marked by a good accuracy, if we allow not to find all of them. The method we evaluated is easily adaptable to other fixed stress languages. To investigate this we adapted our datadriven method to Finnish language and got similar results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Computer Assisted Pronunciation Teaching (CAPT) and Pedagogy: Improving EFL learners’ Pronunciation Using Clear Pronunciation 2 Software

This study examined the impact of Clear Pronunciation 2 software on teaching English suprasegmental features, focusing on stress, rhythm and intonation. In particular, the software covers five topics in relation to suprasegmental features including consonant cluster, word stress, connected speech, sentence stress and intonation. Seven Iranian EFL learners participated in this study. The study l...

متن کامل

Automatic Segmentation of Continuous Speech on Word Level Based on Supra-segmental Features

This article presents a cross-lingual study for Hungarian and Finnish about the segmentation of continuous speech on word and phrasal level by examination of supra-segmental parameters. A word level segmentationer has been developed which can indicate the word boundaries with acceptable precision for both languages. The ultimate aim is to increase the robustness of speech recognition on the lan...

متن کامل

A Systematic Review of Hindi Prosody

Prosody describes both form and function of a sentence using the suprasegmental features of speech. Prosody phenomena are explored in the domain of higher phonological constituents such as word, phonological phrase and intonational phrase. The study of prosody at the word level is called word prosody and above word level is called sentence prosody. Word Prosody describes stress pattern by compa...

متن کامل

Word segmentation in Persian continuous speech using F0 contour

Word segmentation in continuous speech is a complex cognitive process. Previous research on spoken word segmentation has revealed that in fixed-stress languages, listeners use acoustic cues to stress to de-segment speech into words. It has been further assumed that stress in non-final or non-initial position hinders the demarcative function of this prosodic factor. In Persian, stress is retract...

متن کامل

Speech Recognition using Acoustic Landmarks and Binary Phonetic Feature Classifiers

In spite of decades of research, Automatic Speech Recognition (ASR) is far from reaching the goal of performance close to Human Speech Recognition (HSR). One of the reasons for unsatisfactory performance of the state-of-the-art ASR systems, that are based largely on Hidden Markov Models (HMMs), is the inferior acoustic modeling of low level or phonetic level linguistic information in the speech...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005